The I COPPE Scale Short Form for measuring multidimensional well‐being: Construct validity and reliability from US, Argentinian, and Italian large samples

Abstract The aim of this study is to present a short form of the I COPPE scale of multidimensional well‐being. We conducted two studies, which include four samples collected across three countries, namely United States, Argentina, and Italy. In the pilot study we tested during the data analysis phase whether it was feasible to reduce the full I COPPE scale by omitting the items dealing with past well‐being. Prompted by the positive results of the pilot study, we launched a final validation study with a sample of 2682 Italian people who completed the I COPPE scale short form, which is designed without items referring to past well‐being. Results from a series of confirmatory factor analyses show that the I COPPE scale short form presents acceptable levels of construct validity and reliability. Moreover, the 7‐factor correlated‐trait model proved to be the best fit for the data. We discuss advantaged of using the I COPPE scale short form along with limitations and future recommendations.


| INTRODUCTION
In recent years, we have witnessed a shift from unitary models of subjective well-being to well-being as a multidimensional construct (Arcidiacono & Di Martino, 2016;Sarriera & Bedin, 2017). Ryff (1989) was amongst the first to propose psychological well-being as an optimal psychological functioning based on six dimensions: selfacceptance, positive relationships, autonomy, control over one's own environment, purpose in life, and the feeling of continuous personal growth. Swarbrick (2006) considers well-being a holistic construct, including multiple areas of health and functioning, such as physical and spiritual health as well as possessing an integrated personality. Diener (2000;Diener, et al., 2009) defined well-being as a general experience of pleasure, satisfaction with life, absence of negative affect, and presence of positive affect. Seligman (2011) developed the PERMA model to explain how the pursuit of well-being can be achieved through a combination of positive emotions, engagement, relationships, meaning, and accomplishment. Keyes (2013) proposed the construct of social well-being, as a positive state of affairs that individuals can derive from adaptation to society in terms of social integration, social acceptance, social contribution, and social coherence.
Most of the above consider well-being from a strictly individual and psychological point of view. Excluding some noticeable exceptions such as Keyes' model of social well-being (2013), the other models neglect to consider how people's well-being extends to socio-cultural domains (Petrillo et al., 2015). Recently, some scholars have argued for the need to expand our understanding of the good life towards a multidimensional vision of subjective, psychological, psychosocial, and socio-community elements of well-being (Sarriera & Bedin, 2017). As shown by Linton et al. (2016), several tools have been proposed to measure multidimensional well-being. Fugl-Meyer et al. (1991), for example, developed the Life Satisfaction Questionnaire-9, which considers mental, physical, social wellbeing, activities, and personal circumstances. The Personal Well-being Index-Adult (International Well-being Group, 2013) also evaluates several domains of well-being including the often-neglected dimension of spiritual well-being. Last, the Multicultural Quality of Life Index (Mezzich et al., 2011) and the Self-Evaluated Quality of Life Questionnaire (Ventegodt et al., 2003) cover both global well-being and all the domains considered by the previous tools.
Within this context, Prilleltensky andcolleagues (2012, 2015) have proposed a multidimensional model of well-being, which considers relevant life domains. This vision derives from the notions that well-being is achieved by the simultaneous fulfilment of needs at the individual, relational, organizational, and community levels (Prilleltensky, 2012). The I COPPE model (Prilleltensky et al., 2015) considers well-being across six domains: interpersonal, community, occupational, physical, psychological, and economic. To these, it also includes an overall assessment of one's life, namely overall well-being. Interpersonal well-being refers to the degree of satisfaction with one's intimate relationship with family, friends and colleagues; community well-being refers to satisfaction with the place where we live; occupational well-being refers to the level of satisfaction with one's main activity, such as work or caring for home and family; physical well-being refers to one's general state of health; psychological well-being refers to the degree of satisfaction with one's emotional life; and lastly, economic well-being refers to one's financial condition (Prilleltensky et al., 2015).
To operationalise their theory of multidimensional well-being, Prilleltensky and colleagues (2015) have developed the I COPPE scale. The I COPPE scale is a self-administered tool that measures the above-mentioned seven domains of well-being through 21 items. This tool also has the important advantage of considering well-being from a temporal perspective. In fact, each dimension of well-being is measured by three items: present, past, and future well-being. This aspect allows for a more stable evaluation of people's perception of well-being over time. To date, the I COPPE scale has been adapted in several countries and across diverse social groups. In Italy, Di Martino et al. (2018) conducted an adaptation study with a national sample, which confirmed its psychometric validity.  scale, which showed a lower relationship among items of the past and their corresponding domains of well-being (Di Martino et al., 2018), we decided to develop a short form of the I COPPE scale that includes only present and future items.
This decision was also endorsed by theoretical and empirical evidence from scientific literature, which suggests that when it comes to people's self-evaluation of well-being, recollection of past events is more prone to errors (e.g., recollections bias, fading effect bias) than the assessment of present and future circumstances (Kahneman & Riis, 2005;Walker et al., 2003). Other studies also suggest that people's evaluation of past well-being is less reliable since it changes based on people's either positive or negative attitude towards past events (Adler & Pansky, 2020;O'Brien et al., 2012).
Other validation studies also support this view. In fact, the Gallup (2009) concluded than only items of present and future ratings offered the best reliability for measuring well-being. Pavot et al. (1998) reached similar conclusions in evaluating the Temporal Satisfaction with Life Scale. As the authors concluded: "…the addition of the past items…. did not result in a significant increment in the prediction of peer-reported SWB" (p. 349).

| Participants and procedure
The findings presented in this article are derived from two studies. The purpose of the pilot study was to reduce the full version of the I COPPE scale from 21 items to 14, by excluding 7 items dealing with past well-being. The main purpose of the pilot study was to derive some initial results from deleting the past component of well-being from the I COPPE scale, before launching a full validation study.
Due to the exploratory nature of the pilot study, we decided to avoid costly primary data collection in this phase and relied instead on secondary data. This is not an unusual practice, which is sometimes recommended to validate the psychometric validity and reliability of short-form instruments (Widaman et al., 2011).
The pilot study draws on data collected from three countries (i.e., United States, Italy, and Argentina), which were chosen for their diversity in terms of language, history, culture, and geography. The US sample-which was used to validate the original version of the scale-was collected in 2015 and is composed of 426 US citizens (212 males and 214 females) with a mean age of 50.86 years (SD = 13.57). For the demographic characteristics of this sample, we refer the readers to Prilleltensky and colleagues (2015).
The Italian sample was collected between April and July 2015 and composed of 2017 Italian people (807 males and 1210 females), with a mean age of 30.528 years (SD = 11.759). The demographic characteristics of this sample are available in the Italian adaptation of the I COPPE scale (see Di Martino et al., 2018).
The Argentinian sample was collected between September and November 2018. The sample was composed of 482 (175 males and 307 females) Argentines with a mean age of about 22 years (SD = 3.78). Unlike the Unites States and Italy, the adapted I COPPE scale to Argentina has not yet been validated and the data were collected on a predominantly local sample of university students living in the province of Tucuman. The participants' demographic characteristics are available in Table 1.
Prompted by the positive results obtained from the pilot study, which is described in detail below, we launched a final validation study. The final validation study is to be considered the primary investigation for the validation of the I COPPE scale short form. The study relies on primary data collected from a new sample of participants who were presented with a short form of the I COPPE scale. In this case, the I COPPE short form does not include items dealing with past well-being. The sample was collected between May and September 2019 and included 2682 Italian people (1034 females and 1684 males), with a mean age of 29.73 years (SD = 12.79). The demographic characteristics of this sample are shown in Table 1.
As can be seen from the data reported in Table 1 and in previous articles (see Di Martino et al., 2018;Prilleltensky et al., 2015), the four samples, despite being different in age and marital status, are quite homogeneous in gender. With the exception of the American sample in which the two categories have almost the same percentage (50%), in the other three samples the proportion of women is around 60% of the total number of participants. Data collection techniques employed to build our samples also varied. An on-line survey panel company contacted the American participants through email, asking them to complete an online survey (see Prilleltensky et al., 2015). Compared to the United States, the studies conducted in Italy and Argentina required translating and back translating the I COPPE scale to ensure meaningful language equivalence between culturally diverse contexts (Brislin, 1970). In addition, the Italian and Argentinian studies employed a combination of random sampling and snowball sampling technique (see Di Martino et al., 2018). Participants were recruited both from the research team's contacts network and through the contacts network of a group of undergraduate university students previously involved in the research. Students were invited to find potential participants amongst their contacts network, which, in turn, was asked to send the link to the questionnaire to other participants. For this purpose, the students were trained in Computer Assisted Telephone Interview methodology and overseen by senior researchers throughout the process of data collection. All participants were invited to fill in the questionnaire via the SurveyMonkey online platform, where research information and instructions on how to fill in the questionnaire were available. A Facebook group was also created to facilitate the dissemination of research materials amongst the data collectors as well as to better share the link to the online questionnaire.

| Measures
As already noted, this study employed two versions of the I COPPE scale. In its original full version, the I COPPE scale asks participants to assess their level of well-being across 7 domains, using a Cantril scale ranging from 0 (minimum) to 10 (maximum) (Prilleltensky at al., 2015). Each domain taps into an item that evaluates well-being in the current moment (present), in the previous year (past), and next year (future). In the pilot study, participants from the three samples (i.e., United States, Italian, and Argentinian) were instructed to complete the full versions of the I COPPE scale, which includes 7 items of past well-being, 7 items of present well-being, and 7 items of future well-being. During data analysis, we omitted the items of past well-being. Henceforth we will refer to this abbreviated scale as the "I COPPE reduced pilot version". In the final validation study conducted in Italy, participants completed a shorter form of the I COPPE scale, with only 14 items, 7 of present well-being, and 7 of future wellbeing. In this case the items concerning the past were never presented to participants. From now on we will refer to this version as "I COPPE short form".

| Data analysis
Data were analysed by means of MPLUS v.8 to calculate descriptive statistics and to implement the confirmatory factor analysis (CFA) models. Maximum likelihood robust was chosen as main estimator, given the presence of univariate and multivariate deviation from normality. To assess model fit, we relied on conventional cut-off points (Hu & Bentler, 1999), which recommend: χ 2 test to be less than 0.05; Tucker-Lewis index and comparative fit index (CFI) more than 0.095; root mean square error of approximation (RMSEA) less than 0.05, and standardized root mean square residual less than 0.08.
Moreover, given the well-known sensitivity of χ 2 test to large sample size (Bentler & Bonett, 1980;Fornell & Larcker, 1981), we ignored its results. As an alternative, we included two more indices, namely Gamma hat (Fan & Sivo, 2007;2009) and McDonald's non-centrality index (McDonald, 1989). This choice was driven by the evidence that they have proven to be robust, amongst other things, to large sample size. The literature suggests as cut-off values, Gamma hat more than 0.95 and (Mc) more than 0.90 (Hu & Bentler, 1999).
Missing values were treated with list-wise deletion, causing a relatively small loss of cases in nearly all instances. Only in the full validation study, the Italian sample presents 261 missing cases, which amounts to 12.9% of the total sample. However, this does not pose any significant threat to the power of our main analyses. In fact, power analyses based on the RMSEA test of close fit (MacCallum et al., 1996) shows that a minimum sample of about 278 respondents is sufficient to reach a recommended power of .8. Since all cases examined exceed this minimum requirement, we can confident that our results did not incur into a Type II error.

| Findings
Following the structure of the original I COPPE scale (Prilleltensky et al., 2015), we first applied a 7-factor correlated-trait model to both the I COPPE reduced pilot version (pilot study) and to the I COPPE short form (full validation study). As we can see in  Table 3 shows that both the I COPPE reduced pilot version and the I COPPE short form present highly significant factor loadings and inter-item reliability values, with a minimum of 0.58 (R 2 = .34) for overall future well-being in the Italian sample, and a maximum of 0.99 (R 2 = 0.98) for occupational present wellbeing in the US sample.
As reported in previous cases (Di Martino et al., 2018;Prilleltensky et al., 2015), the items of present wellbeing show the highest loadings and inter-item reliability.

| Reliability and construct validity
The reliability of the 7 factors making up the I COPPE reduced pilot version and the I COPPE short form were assessed through composite reliability (CR), which has demonstrated to perform better than the most commonly used cronbach alpha, particularly in cases of non tau-equivalence (Raykov, 1997). Values of CR higher than 0.7 are considered a sign of good reliability.
Convergent validity was assessed through average variance extracted (AVE), which has better performance properties than the most commonly used Campbell and Fiske's (1959) method, which relies on correlations between the target factors with other instruments deemed to measure similar constructs (Cheung & Wang, 2017). Values of AVE higher than 0.5 are indicative of good convergent validity. In addition, discriminant validity can be established if AVE is higher than both maximum squared shared variance (MSV) and average shared square variance (ASV) (Hair et al., 2010). The only exception we found is for economic well-being in the I COPPE short form. However, only 0.002 points of difference between AVE and MSV should not pose a serious threat to the discriminant validity of this factor.
These results suggest that both the I COPPE scale reduced pilot version and the I COPPE short form present acceptable levels of reliability, convergent, and discriminant validity.

| Model comparisons
Having established the psychometric proprieties of both the I COPPE scale reduced pilot version and the I COPPE short form, in this last paragraph we will report the results of comparisons of the I COPPE short form against alternative CFA variable explaining all 14 items of well-being. A Bi-factor solution was not included on this occasion because it was not possible to reach an identified solution. This condition is likely due to the complexity of the model.
When comparing nested models with large samples, the χ 2 difference test tends to suffer from the same shortcomings of absolute χ 2 test (Brannick, 1995). In our case, given 2943 observations, it is advisable not to trust results from χ 2 difference test. As an alternative, we followed Fan and Sivo's (2009)  to Model A, we will rely on differences in Bayesian information criterion (BIC) values, which are more robust to large sample size than Akaike information criterion (AIC). Kass and Raftery (1995), suggest the following guidelines to assess differences in BIC: between 1 and 3 = not worth mentioning, between 3 and 20 = positive, between 20 and 150 = strong, higher than 150 = very strong.
Results reported in Table 5 show that, consistent with the I COPPE scale full version, the 7-factor correlatedtraits solution (Model A) presents better fit to the data than its competing models.   Table 4.

| DISCUSSION
Our results lend support to the hypothesis that a shorter version of the I COPPE scale presents good psychometric proprieties across all samples considered. First, we carried out a pilot study to test whether the I COPPE scale could be restructured into a reduced pilot version by omitting items of past well-being in the data analysis phase.
Then we launched a final validation study to test responses to the I COPPE short form, which does not include items of past well-being, even in the survey. In all cases examined, we found high levels of Composite Reliability and AVE, the latter exceeding, in all but one case, the values of Maximum Shared Variance and ASV. These results suggest that a shorter version of the I COPPE scale possesses good levels of both reliability as well as convergent and discriminant validity.
In terms of parameter estimates, all congeneric variables show high factor loadings and inter-item reliability values with respect to their corresponding factors. The items of present well-being show the highest values, indicating that people are more consistent and less biased in their present evaluations when trying to assess their state of well-being.
Lastly, the I COPPE scale short form was tested against 3 competing models, namely a 2nd order solution, a | 705 full version, its short form is best expressed through a model with 7 factors correlated-traits, with each factor (or domain of well-being) tapping into one item of past well-being and 1 item of future well-being.

| LIMITATIONS AND RECOMMENDATIONS
An important limitation to consider when analysing data collected through the I COPPE scale short form is that each of the 7 latent variables that form the domains of well-being are explained by only two manifest variables. As we are aware, the literature encourages the use of more than two parameters in confirmatory factor analysis, for reasons of identification (Bollen & Davis, 2009). In fact, a limited number of parameters per factor is likely to generate improper solutions. The most common we encountered for the I COPPE short form is the presence of Heywood cases with negative measurement error variances (Bentler & Chou, 1988). This is particularly likely to happen when data are collected from small samples (Gerbing & Anderson, 1987).
Several solutions have been proposed to deal with Haywood cases (see Chen et al., 2001;Dillon et al., 1987). In our case, we managed to avoid altogether incurring into Haywood cases by correlating the residual errors of all the items of future well-being, rather than the ones of present well-being. However, from a statistical point of view, we still recommend the use of the full version of the I COPPE scale whenever possible to avoid Haywood cases altogether. Whenever a justifiable choice is made for using its short form, we recommend collecting relatively large samples, carefully inspecting the residual matrix, and also adopting the strategies described by Chen and colleagues (2001).
Although the main validation of the I COPPE short form gave positive results, we recommend caution in generalising the findings beyond the Italian context. This is appearing even more relevant in view of some lack of homogeneity amongst the samples collected. This is especially true for the US and the Argentinian samples, where the difference between the mean age of the participants is approximately 28 years. Therefore, it is desirable that future research test the validity of the I COPPE scale short form with samples as homogeneous as possible.
One last limitation refers to the amount of missing data in the final validation study. Although we demonstrated that the reduction of nearly 13% of the sample due to the deletion of missing data, does not harm the statistical power of our main tests, we should still be mindful that this could influence the generalizability of our results.

| CONCLUSIONS
The study confirms that the I COPPE scale short form, which includes only present and future well-being items, presents high validity and reliability in the measurement of well-being. This result is in line with previous analysed conducted by Myers  past, at least in some circumstances, may offer negligible empirical contributions over and above an individual's perceptions of the present and future in the practical assessment of multidimensional well-being" (p. 796).
Moreover, our findings suggest that the I COPPE scale short form could be used in different countries, given that we found similar psychometric properties across the United States, Italian, and Argentinian samples. However, we should also be mindful that the people from the pilot study, who completed the I COPPE scale reduced pilot version were presented with a somewhat different version of the final I COPPE short form, which was validated in the final validation study. Future studies might bring further evidence to the validity of the I COPPE short form in other countries other than Italy. The universality of the I COPPE scale short form would be better assessed through more suitable statistical techniques such as multigroup invariance as well as a higher number and more diverse type of countries.
Last, we should acknowledge some of the advantages that a short form of the I COPPE scale offers to those interested in measuring multidimensional well-being. First of all, a shorter number of items allows for less time to complete the survey, with a consequent reduction of biases due to respondent fatigue (Ben-Nun, 2008). Furthermore, sometimes using too many items in the same survey can be counterproductive, in that the multidimensionality of a construct gets confused with the multiplicity of factors that cause it (Bowling, 2005). Instead, fewer carefully selected items help reduce the complexity of an instrument without undermining its multidimensionality, which is fundamental to acknowledge the complex nature of well-being.
In conclusion, professionals interested in the measurement of well-being can benefit from using the I COPPE short form whilst still being confident enough in the validity and reliability of their results.

PEER REVIEW
The peer review history for this article is available at https://publons.com/publon/10.1002/jcop.22659

DATA AVAILABILITY STATEMENT
Data available on request from the authors.
F I G U R E 1 7-Factor correlated-trait model of I COPPE short form. N.B. All values are significant at the .1% alpha levelN.B.2 Only correlations more than 0.5 were depicted to reduce clutter* ρc = composite reliability; ℓ2 = inter-item reliability; ε = measurement residuals